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ABSTRACT 



Libraries of unimolecular, double- stranded oligonucleotides 
on a solid support. These libraries arc useful in pharmaceu- 
tical discovery for the screening of numerous biological 
samples for specific interactions between the double- 
stranded oligonucleotides, and peptides, proteins, drugs and 
RNA, In a related aspect, the present invention provides 
libraries of conformationally restricted probes on a solid 
support. The probes arc restricted in their movement and 
flexibility using double-stranded oHgonucleotides as scaf- 
folding. The probes arc also useful in various screening 
procedures associated with drug discovery and diagnosis. 
The present invention further provides methods for the 
preparation and screening of the above libraries. 

6 Claims, 1 Drawing Sheet 
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SURFACE-BOUND, UNIMOLECULAR, 
DOUBLE-STRANDED DNA 

GOVERNMENT RIGHTS 

Research leading to the invention was funded in part by 
NTH Grant No. R01HG00813-03 and the government may 
have certain fights to the invention. 

BACKGROUND OF THE INVENTION 
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The present invention relates to the field of polymer 
synthesis and the use of polymer libraries for biological 
screening. More specifically, in one embodiment the inven- 
tion provides arrays of diverse double-stranded oligonucle- 
otide sequences. In another embodiment, the invention pro- 
vides arrays of conformationally restricted probes, wherein 
the probes are held in position using double-stranded DNA 
sequences as scaffolding. Libraries of diverse unimolecular 
double-stranded nucleic acid sequences and probes may be 
used, for example, in screening studies for determination of 
binding affinity exhibited by binding proteins, drugs, or 
RNA. 

Methods of synthesizing desired single stranded DNA 
sequences are well known to those of skill in the art. In 
particular, methods of synthesizing oligonucleotides arc 
found in, for example, Oligonucleotide Synthesis: A Prac- 
tical Approach, Gait, ed., IRL Press, Oxford (1984), incor- 
porated herein by reference in its entirety for all purposes. 
Synthesizing unimolecular double-stranded DNA in solution 
has also been described. Sec, Durand, et ai. Nucleic Acids 
Res 18:6353-6359 (1990) and Thomson, et al. Nucleic 
Acids Res, 21:5600-5603 (1993). the disclosures of both 
being incorporated herein by reference. 

Solid phase synthesis of biological polymers has been 
evolving since the early "Merrifield" solid phase peptide 
synthesis, described in Merrifield, 7. Am, Chem, Soc. 
85:2149-2154 (1963), incorporated herein by reference for 
all purposes. Solid-phase synthesis techniques have been 
provided for the synthesis of several peptide sequences on, 
for example, a number of "pins." See e.g., Geysen et al.. J. 
Immun. Meth. 102:259-274 (1987), incorporated herein by 
reference for all purposes. Other solid-phase techniques 
involve, for example, synthesis of various peptide sequences 
on different cellulose disks supported in a column. See Frank 
and Doring, Tetrahedron 44:6031-6040 (1988), incorpo- 
rated herein by reference for all purposes. Still other solid- 
phase techniques arc described in U.S. Pat. No. 4,728,502 
issued to Hamill and WO 90/00626 (Beattie, inventor). 

Each of the above techniques produces only a relatively 
low density array of polymers. For example, the technique 
described in Geysen et al. is limited to producing 96 
different polymers on pins spaced in the dimensions of a 
standard microtiter plate. 

Improved methods of forming large arrays of oligonucle- 
otides, peptides and other polymer sequences in a short 
period of time have been devised. Of particular note, Piming 
et al., U.S. Pal. No. 5,143,854 (sec also PCT Application No. 
WO 90/15070) and Fodor et al., PCT Publication No. WO 
92/10092, all incorporated herein by reference, disclose 
methods of forming vast arrays of peptides, oligonucleotides 
and other polymer sequences using, for example, light- 
directed synthesis techniques. See also, Fodor et al.. Science, 
251:767-777 (1991), also incorporated herein by reference 
for all purposes. These procedures are now referred to as 
VLSIPS '"^ procedures. 



In the above-referenced Fodor et al., PCT application, an 
elegant method is described for using a computer-controlled 
system to direct a VLSIPS™ procedure. Using this 
approach, one heterogenous array of polymers is converted, 
through simultaneous coupling at a number of reaction sites, 
into a different heterogenous array. See, U.S. Pat. No. 
5,384.261 and U.S. application Sen No. 07/980,523, the 
disclosures of which are incorporated herein for all pur- 
poses. 

10 The development of VLSIPS™ technology as described 
in the above-noted U.S. Pat, No. 5,143,854 and PCT patent 
publication Nos. WO 90/15070 and 92/10092, is considered 
pioneering technology in the fields of combinatorial synthe- 
sis and screening of combinatorial libraries. More recenUy, 
15 patent application Ser. No. 08/082,937, filed Jun. 25, 1993 
now abandoned, describes methods for making arrays of 
oligonucleotide probes that can be used to check or deter- 
mine a partial or complete sequence of a target nucleic acid 
and to detect the presence of a nucleic acid containing a 
specific oligonucleotide sequence. 

A number of biochemical processes of pharmaceutical 
interest involve the interaction of some species, e.g., a drug, 
a peptide or protein, or RNA, with double-stranded DNA. 
For example, protein/DNA binding interactions are involved 
with a number of transcription factors as well as tumor 
suppression associated with the p53 protein and the genes 
contributing to a number of cancer conditions. 
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SUMMARY OF THE INVENTION 

High-density arrays of diverse unimolecular, double- 
stranded oligonucleotides, as well as arrays of conforma- 
tionally restricted probes and methods for their use are 
provided by virtue of the present invention. In addition, 
methods and devices for detecting duplex formation of 
oligonucleotides on an array of diverse single-stranded 
oligonucleotides arc also provided by this invention. Fur- 
ther, an adhesive based on the specific binding characteris- 
tics of two arrays of complementary oHgonucleotides is 
provided in the present invention. 

According to one aspect of the present invention, libraries 
of unimolecular, double-stranded oligonucleotides are pro- 
vided. Each member of the library is comprised of a solid 
support, an optional spacer for attaching the double-stranded 
oligonucleotide to the support and for providing sufficient 
space between the double-stranded oligonucleotide and the 
solid support for subsequent binding studies and assays, an 
oligonucleotide attached to tiie spacer and further attached to 
a second complementary oligonucleotide by means of a 
flexible linker, such that the two oligonucleotide portions 
exist in a double-stranded configuration. More particularly, 
the members of the libraries of the present invention can be 
represented by the formula: 

Y_L'— X'— L^— 

in which Y is a solid support, L* is a bond or a spacer, I? is 
a flexible linking group, and and are a pair of 
complementary oligonucleotides. 

In a specific aspect of the invention, the library of 
different unimolecular. double-stranded oligonucleotides 
can be used for screening a sample for a species which binds 
to one or more members of the library. 

In a related aspect of the invention, a library of different 
conformationally-restricted probes attached to a solid sup- 
port is provided. The individual members each have the 
formula: 
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in which X^* and X'^ are complcmcniary oligonucleotides 
and Z is a probe having sufficient length such that X^^ and 
X^^ form a double-stranded oligonucleotide portion of the ^ 
member and thereby restrict the conformations available to 
the probe. In a specific aspect of the invention, the library of 
different conformationaily-restricted probes can be used for 
screening a sample for a species which binds to one or more 
probes in the library: lO 

According to yet another aspect of the present invention, 
methods and devices for the bioelectronic detection of 
duplex formation arc provided. 

According to still another aspect of the invention, an 
adhesive is provided which comprises two surfaces of ^ 
complcmcniary oligonucleotides. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGS. 1 A to IF illustrate the preparation of a member of 
a library of surface-bound, uni molecular doublc-strandcd 
DNA as well as binding studies with receptors having 
specificity for cither the double stranded DNA portion, a 
probe which is held in a conformationally restricted form by 
DNA scaffolding, or a bulge or loop region of RNA. ^ 

DESCRIPTION OF THE PREFERRED 
EMBODIMENT 

Abbreviations 

The following abbreviations arc used herein: phi, phenan- 30 
ihrcncquinonc diimine; phen', 5-amido-glutaric acid- 1,1 0- 
phcnanthroline; dppz, dipyridophenazinc. 
Glossary 

The following terms arc intended to have the following 
general meanings as they arc used herein: 35 

Chemical terms: As used herein, the term "alkyl" refers to 
a saturated hydrocarbon radical which may be straight-chain 
or branchcd-chain (for example, ethyl, isopropyl, t-amyl, or 
2,5-dimclhylhcxyl). When "alkyl" or '*alkylcnc" is used to 
refer to a linking group or a spacer, it is taken to be a group 40 
having two available valences for covalcnt attachment, for 
example, — CH2CH2 — , — CH2CH2CH2— , 

— CH2CH2CH(CH3)CH2— and — CH2(CH2CH2)2CH2— . 
Preferred aikyl groups as substitucnts arc those containing 1 
to 10 carbon atoms, with those containing I to 6 carbon 45 
atoms being particularly preferred. Prcrcrrcd alkyl or alky- 
Icne groups as Unking groups arc those containing 1 to 20 
carbon atoms, with those containing 3 to 6 carbon atoms 
being particularly preferred. The term "polyethylene glycol" 
is used to refer to those molecules which have repealing 50 
units of ethylene glycol, for example, hexaethylenc glycol 
(HO— (CH2CH20)3— CHjCHjOH). When the term "poly- 
ethylene glycol" is used to refer to linking groups and spacer 
groups, it would be understood by one of skill in the art that 
other polycthcrs or polyols could be used as well (i. e, 55 
polypropylene glycol or mixtures of ethylene and propylene 
glycols). 

The term "protecting group" as used herein, refers to any 
of the groups which arc designed to block one reactive site 
in a molecule while a chemical reaction is carried out at 60 
another reactive site. More particularly, the protecting 
groups used herein can be any of those groups described in 
Greene, el al., Protective Groups In Organic Chemistry^ 2nd 
Ed.. John Wiley & Sons, New York, N.Y. 1991 , incorporated 
herein by reference. The proper selection of protecting 65 
groups for a particular synthesis will be governed by the 
overall methods employed in the synthesis. For example, in 



"light-directed" synthesis, discussed below, the protecting 
groups will be photolabiie protecting groups such as NVOC. 
MeNPOC, and those disclosed in co-pending Application 
PCT/US93/10162 (filed Oct 22, 1993), incorporated herein 
by reference. In other methods, protecting groups may be 
removed by chemical methods and include groups such as 
FMOC. DMT and others known to those of skill in the art. 

Complementary or substantially complementary: Refers 
to the hybridization or base pairing between nucleotides or 
nucleic acids, such as, for instance, between the two strands 
of a double stranded DNA molecule or between an oligo- 
nucleotide primer and a primer binding site on a single 
stranded nucleic acid to be sequenced or amplified. Comple- 
mentary nucleotides arc, generally, A and T (or A and U). or 
C and G. Two single stranded RNA or DNA molecules are 
said to be substantially complementary when the nucleotides 
of one strand, optimally aligned and compared and with 
appropriate nucleotide insertions or deletions, pair with at 
least about 80% of the nucleotides of the other strand, 
usually at least about 90% to 95%, and more preferably from 
about 98 to 100%. 

Alternatively, substantial complementary exists when an 
RNA or DNA strand will hybridize under selective hybrid- 
ization conditions to its complement. Typically, selective 
hybridization will occur when there is at least about 65% 
complementary over a stretch of at least 14 to 25 nucle- 
otides, preferably at least about 75%, more preferably at 
least about 90% complementary. S. ec, M. Kanchisa A/wc/e/c 
Acids Res. 12:203 (1984). incorporated herein by reference. 

Stringent hybridization conditions will typically include 
salt concentrations of less than about IM, more usually less 
than about 500 mM and preferably less than about 200 mM. 
Hybridization temperatures can be as low as 5° C, but are 
typically greater than 22** C, more typically greater than 
about 30° C, and preferably in excess of about 3T C. 
Longer fragments may require higher hybridization tem- 
peratures for specific hybridization. As other factors may 
affect the stringency, of hybridization, including base com- 
position and length of the complcmcniary strands, presence 
of organic solvents and extent of base mismatching, the 
combination of parameters is more important than the abso- 
lute measure of any one alone. 

Epitope: The portion of an antigen molecule which is 
delineated by the area of interaction with the subclass of 
receptors known as antibodies. 

Identifier lag: A means whereby one can identify which 
molecules have experienced a particular reaction in the 
synthesis of an oligomer. The identifier tag also records the 
step in the synthesis series in which the molecules experi- 
enced thai particular monomer reaction. The identifier tag 
may be any recognizable feature which is, for example: 
microscopically distinguishable in shape, size, color, optical 
density, etc.; differently absorbing or emitting of light; 
chemically reactive; magnetically or electronically encoded; 
or in some other way distinctively marked with the required 
information. A preferred example of such an identifier tag is 
an oligonucleotide sequence. 

Ligand/Probe: A ligand is a molecule that is recognized by 
a particular receptor. The agent bound by or reacting with a 
receptor is called a "ligand." a term which is definitionally 
meaningful only in terms of its counterpart receptor. The 
term "ligand" docs not imply any particular molecular size 
or other structural or compositional feature other than that 
the substance in question is capable of binding or otherwise 
interacting with the receptor. Also, a ligand may serve either 
as the natural ligand to which the receptor binds, or as a 
functional analogue that may act as an agonist or antagonist. 
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Examples of ligands that can be investigated by this inven- 
tion include, but are not restricted to, agonists and antago- 
nists for cell membrane receptors, toxins and venoms, viral 
epitopes, hormones (e.g., opiates, steroids, etc.), hormone 
receptors, peptides, enzymes, enzyme substrates, substrate 
analogs, transition state analogs, cofactors, drugs, proteins, 
and antibodies. The term "probe" refers to those moleciJes 
which are expected to act like ligands but for which binding 
information is typically unknown. For example, if a receptor 
is known to bind a ligand which is a peptide P-tum, a 
"probe" or library of probes will be those molecules 
designed to mimic the peptide P-tum, In instances where the 
particular ligand associated with a given receptor is 
unknown, the term probe refers to those molecules designed 
as potential ligands for the receptor. 

Monomer: Any member of the set of molecules which can 
be joined together to form an oligomer or polymer. The set 
of monomers useful in the present invention includes, but is 
not restricted to, for the example of oligonucleotide synthe- 
sis, the set of nucleotides consisting of adenine, thymine, 
cytosine, guanine, and uridine (A, T, C, G, and U, respec- 
tively) and synthetic analogs thereof. As used herein, mono- 
mers refers to any member of a basis set for synthesis of an 
oligomer. Different basis sets of monomers may be used at 
successive steps in the synthesis of a polymer. 

Oligomer or Polymer: The oligomer or polymer 
sequences of the present invention are formed from the 
chemical or enzymatic addition of monomer subunits. Such 
oligomers include, for example, both linear, cyclic, and 
branched polymers of nucleic acids, polysaccharides, phos- 30 
pholipids, and peptides having cither a-, P-, or co-amino 
acids, heteropolymers in which a known drug is covalently 
bound to any of the above, polyurethanes, polyesters, poly- 
carbonates, polyureas. polyamides, poly ethyl eneimines, 
polyarylcnc sulfides, poiysiloxanes, polyimides, polyac- 
etates, or other polymers which will be readily apparent to 
one skilled in the art upon review of this disclosure. As used 
herein, the term oligomer or polymer is meant to include 
such molecules as p-lum mimelics, prostaglandins and ben- 
zodiazepines which can also be synthesized in a stepwise 
fashion on a solid support. 

Peptide: A peptide is an oligomer in which the monomers 
arc amino acids and which are joined together through 
amide bonds and alternatively referred to as a polypeptide. 
In the context of this specification it should be appreciated 
that when a-amino acids are used, they may be the L-oplical 
isomer or the D-optical isomer. Other amino acids which arc 
useful in the present invention include unnatural amino acids 
such a P-alaninc, phenylglycinc, homoarginine and the like. 
Peptides arc more than two amino acid monomers long, and 
often more than 20 amino acid monomers long. Standard 
abbreviations for amino acids are used (e.g., P for proline). 
These abbreviations are included in Stryer, Biochemistry, 
Third Ed., (1988), which is incorporated herein by reference 
for all purposes. 

OligonucleoUdes: An oligonucleotide is a single-stranded 
DNA or EINA molecule, typically prepared by synthetic 
means. Alternatively, namrally occurring oligonucleotides, 
or fragments thereof, may be isolated from their natural 
sources or purchased from commercial sources. Those oli- 
gonucleotides employed in the present invention will be 4 to 
100 nucleotides in length, preferably from 6 to 30 nucle- 
otides, although oligonucleotides of different length may be 
appropriate. Suitable oligonucleotides may be prepared by 
the phosphoramidite method described by Beaucage and 65 
Carruthers, Tetrahedron Utt., 22:1859-1862 (1981), or by 
the triester method according to Matteucci, et al., J. Am. 
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Chem. Soc, 103:3185 (1981), both incorporated herein by 
reference, or by other chemical methods using either a 
commercial automated oligonucleotide synthesizer or 
VLSIPS'*'^ technology (discussed in detail below). When 
oligonucleotides are referred to as "double-stranded," it is 
understood by those of skill in the art that a pair of 
oligonucleotides exist in a hydrogen-bonded, helical array 
typically associated with, for example, DNA. In addition to 
the 100% complementary form of double-stranded oligo- 
nucleotides, the term "double-stranded" as used herein is 
also meant to refer to those forms which include such 
structural features as bulges and loops, described more fixUy 
in such biochemistry texts as Stryer, Biochemistry, Third 
Ed„ (1988), previously incorporated herein by reference for 
all purposes. 

Receptor: A molecule that has an affinity for a given 
ligand or probe. Receptors may be naturally-occurring or 
manmade molecules. Also, tiiey can be employed in their 
unaltered natural or isolated state or as aggregates with other 
species. Receptors may be attached, covalently or nonco- 
valenUy, to a binding member, either directiy or via a 
specific binding substance. Examples of receptors which can 
be employed by this invention include, but are not restricted 
to, antibodies, cell membrane receptors, monoclonal anti- 
bodies and antisera reactive with specific antigenic deter- 
minants (such as on viruses, cells or other materials), drugs, 
polynucleotides, nucleic acids, peptides, cofactors, lectins, 
sugars, polysaccharides, cells, cellular membranes, and 
organelles. Receptors are sometimes referred to in the art as 
anti-ligands. As the term receptors is used herein, no difiFer- 
ence in meaning is intended. A "ligand-receptor pair" is 
formed when two molecules have combined through 
molecular recognition to form a complex. Other examples of 
receptors which can be investigated by this invention 
include but arc not restricted to: 

a) Microorganism receptors: Determination of ligands or 
probes that bind to receptors, such as specific u-ansport 
proteins or enzymes essential to survival of microor- 
ganisms, is useful in a new class of antibiotics. Of 
particular value would be antibiotics against opporm- 
nislic fungi, protozoa, and tiiose bacteria resistant to the 
antibiotics in current use. 

b) En/.ymcs: For instance, the binding site of enzymes 
such as the enzymes responsible for cleaving neu- 
rotransmitters. Determination of ligands or probes that 
bind to certain receptors, and tiius modulate the action 
of the enzymes that cleave the different neurotransmit- 
ters, is useful in the development of dnigs that can be 
used in the ircaimcnl of disorders of neurotransmission. 

c) Antibodies: For instance, the invention may be useful 
in investigating the ligand-binding site on the antibody 
molecule which combines with Uie epitope of an anti- 
gen of interest. Determining a sequence that mimics an 
antigenic epitope may lead to the development of 
vaccines of which the immunogen is based on one or 
more of such sequences, or lead to the development of 
related diagnostic agents or compounds useful in thera- 
peutic treatments such as for autoimmune diseases 
(e.g., by blocking the binding of the "self antibodies). 

d) Nucleic Acids: The invention may be useful in inves- 
tigating sequences of nucleic acids acting as binding 
sites for cellular proteins ("trans-acting factors"). Such 
sequences may include, e.g., U^scriplion factors, sup- 
pressors, enhancers or promoter sequences. 

e) Catalytic Polypeptides: Polymers, preferably polypep- 
tides, which are capable of promoting a chemical 
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reaction involving the conversion of one or niorc 
rcactants to one or more products. Such polypeptides 
generally include a binding site specific for at least one 
reactant or reaction intermediate and an active func- 
tionality proximate to the binding site, which function- 
ality is capable of chemically modifying the bound 
reactant. Catalytic polypeptides are described in, 
Lemcr, R.A. et al.. Science 252: 659 (1991). which is 
incorporated herein by reference. 
0 Hormone receptors: For instance, the receptors for 
insulin and growth hormone. Determination of the 
ligands which bind with high affinity to a receptor is 
useful in the development of, for example, an oral 
replacement of the daily injections which diabetics 
must take to relieve the symptoms of diabetes, and in 
the other case, a replacement for the scarce human 
growth hormone that can only be obtained from cadav- 
ers or by recombinant DNA technology. Other 
examples arc the vasoconstrictive hormone receptors; 
determination of those ligands that bind to a rcceptor 
may lead to the development of drugs to control blood 
pressure. 

g) Opiate receptors: Determination of ligands that bind to 
the opiate receptors in the brain is useful in the devel- 
opment of less-addictive replacements for morphine 
and related drugs. 
Substrate or Solid Support: A material having a rigid or 
semi-rigid surface. Such materials will preferably lake the 
form of plates or slides, small beads, pellets, disks or other 
convenient forms, although other forms may be used. In 
some embodiments, at least one surface of the substrate will 
be substantially flat. In other embodimenls, a roughly spheri- 
cal shape is preferred. 

Synthetic: Produced by in vitro chemical or enzymatic 
synthesis. The synthetic libraries of the present invention 
may be contrasted with those in viral or plasmid vectors, for 
instance, which may be propagated in bacterial, yeast, or 
other living hosts. 

DESCRIPTION OF THE INVENTION 

The broad concept of the present invention is illustrated in 
FIGS. lA to IF. FIGS. lA, IB and IC illustrate the prepa- 
ration of surface-bound unimolccular double stranded DNA, 
while FIGS. ID, IE, and IF illustrate uses for the libraries 
of the present invention. 

FIG. 1 A shows a solid support 1 having an attached spacer 
2, which is optional. Attached to the distal end of the spacer 
is a first oligomer 3, which can be attached as a single unit 
or synthesi/xd on the support or spacer in a monomer by 
monomer approach. FIG. IB shows a subsequent stage in 
the preparation of one member of a library according to the 
present invention. In this stage, a flexible linker 4 is attached 
to the distal end of the oligomer 3. In other embodiments, the 
flexible linker will be a probe. FIG. IC shows the completed 
surface-bound unimolccular double stranded DNA which is 
one member of a library, wherein a second oligomer 5 is now 
attached to the distal end of the flexible linker (or probe). As 
shown in FIG. IC, the length of the flexible linker (or probe) 60 
4 is sufficient such that the first and second oligomers (which 
arc complementary) exist in a double- stranded conforma- 
tion. It will be appreciated by one of skill in the art, that the 
libraries of the present invention will contain multiple, 
individiially synthcsizxd members which can be screened for 
various types of activity. Three such binding events are 
illustrated in FIGS. 1 D, IE and IF 
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In FIG. ID, a receptor 6, which can be a protein, RNA 
molecule or other molecule which is known to bind to DNA, 
is introduced to the library. Determining which member of 
a library binds to the receptor provides information which is 
useful for diagnosing diseases, sequencing DNA or RNA. 
identifying genetic characteristics, or in drug discovery. 

In FIG. IE, the linker 4 is a probe for which binding 
information is sought. The probe is held in a conformation- 
ally restricted manner by the flanking oligomers 3 and 5, 
which arc present in a double-stranded conformation. As a 
result, a library of conformationally restricted probes can be 
screened for binding activity with a receptor 7 which has 
specificity for the probe. 

The present invention also contemplates the preparation 
of libraries of unimolccular, double-stranded oligonucle- 
otides having bulges or loops in one of the strands as 
depicted in FIG. IE In HG. IF, one oligonucleotide 5 is 
shown as having a bulge 8. Specific RNA bulges are often 
recognized by proteins (e.g., TAR RNA is recognized by the 
TAT protein of HIV). Accordingly, libraries of RNA biilgcs 
or loops arc useful in a number of diagnostic applications. 
One of skill in the art will appreciate that the bulge or loop 
can be present in either oligonucleotide portion 3 or 5. 
Libraries of Unimolccular, Double-Stranded Oligonucle- 
otides 

In one aspect, the present invention provides libraries of 
unimolecular double-stranded oligonucleotides, each mem- 
ber of the library having the formula: 

in which Y represents a solid support, X' and represent 
a pair of complementary oligonucleotides, represents a 
bond or a spacer, and represents a linking group having 
sufficient length such that X' and X^ form a double-stranded 
oligonucleotide. 

The solid support may be biological, nonbiological. 
organic, inorganic, or a combination of any of these, existing 
as panicles, su-ands, precipitates, gels, sheets, tubing, 
spheres, containers, capillaries, pads, slices, films, plates, 
slides, etc. The solid support is preferably flat but may take 
on alternative surface configurations. For example, the solid 
support may contain raised or depressed regions on which 
synthesis takes place. In some embodiments, the solid 
support will be chosen to provide appropriate light-absorb- 
ing characteristics. For example, the support may be a 
polymerized Langmuir Blodgclt film, funciionalizcd glass. 
Si, Gc, GaAs, GaP, SiO^, SiN^, modified silicon, or any one 
of a variety of gels or polymers such as (poly)lclranuoro- 
clhylene, (poly)vinylidcndifluoridc, polystyrene, polycar- 
bonate, or combinations thereof. Other suitable solid support 
materials will be readily apparent to those of skill in the art. 
Preferably, the surface of the solid support will contain 
reactive groups, which could be carboxyl, amino, hydroxyl, 
thiol, or the like. More preferably, the surface will be 
optically transparent and will have surface Si— OH func- 
tionalities, such as arc found on silica surfaces. ^ 

Attached to the solid support is an optional spacer, L . The 
spacer molecules arc preferably of sufficient length to permit 
the double-stranded oligonucleotides in the completed mem- 
ber of the library to interact freely with molecules exposed 
to the library. The spacer molecules, when present, arc 
typically6-50 atoms long to provide sufficient exposure for 
the attached double-stranded DNA molecule. The spacer, , 
is comprised of a surface attaching poition and a longer 
chain portion. The surface attaching portion is that part of L* 
which is directly attached to the solid support. This portion 
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can be attached to the solid support via carbon-carbon bonds 
using, for example, supports having (poly)trifluorochloro- 
elhylene surfaces, or preferably, by siloxane bonds (using, 
for example, glass or silicon oxide as the solid support). 
Siloxane bonds with the surface of the support are formed in 
one embodiment via reactions of surface attaching portions 
bearing trichlorosilyl or trialkoxysilyl groups. The surface 
attaching groups will also have a site for attachment of Uie 
longer chain portion. For example, groups which are suitable 
for attachment to a longer chain porUon would include 
amines, hydroxyl. thiol, and carboxyl. Preferred surface 
attaching portions include aminoalkylsilanes and hydroxy- 
alkylsilanes. In particularly preferred embodiments, the sur- 
face attaching portion of is either bis(2-hydroxyethyl)- 
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aminopropyltriethoxysilane, 

2-hydroxyethylaminopropyUrielhoxysilane, ammopropyltn- 
cthoxysilanc or hydroxypropyltriethoxysilane. 

The longer chain portion can be any of a variety ot 
molecules which are inert to the subsequent conditions for 
polymer synthesis. These longer chain portions will typi- 
cally be aryl acetylene, ethylene glycol oligomers contaimng 
2-14 monomer units, diamines, diacids, amino acids, pep- 
tides or combinations thereof. In some embodiments, the 
longer chain portion is a polynucleotide. The longer chain 
portion which is to be used as part of can be selected 25 
based upon its hydrophilicAiydrophobic properties to 
improve presentation of the double-stranded ohgonucle- 
otides to certain receptors, proteins or dnigs. The longer 
chain portion of can be constructed of polyethylcncgly- 
cols polynucleotides, alkylene, polyalcohol, polyester, 
polyamine, polyphosphodi ester and combinations thereof. 
Additionally, for use in synthesis of the libraries of the 
invention, L' will typically have a protecting group, attached 
to a functional group (i.e., hydroxyl, amino or carboxylic 
acid) on the distal or terminal end of the chain portion 
(opposite the solid support). After deprotection and cou- 
pling, the distal end is covalently bound to an oligomer^^ 
Attached to the distal end of is an oligonucleotide, X , 
which is a single-stranded DNA or RNA molecule. The 
oligonucleotides which are part of the present invention are 
typically of from about 4 to about 100 nucleotides in length. 
Preferably, X' is an oligonucleotide which is about 6 to 
about 30 nucleotides in length. The oligonucleotide is typi- 
cally linked to via the 3'-hydroxyl group of the oligo- 
nuclcoudc and a functional group on L» which results in the 45 
formation of an ether, ester, carbamate or phosphate ester 

linkage. . ^. ^ . .2 

Attached to the distal end of X* is a linking group, L , 
which is flexible and of sufficient length that X can cffec- 
lively hybridize with X^. The length of the linker will 
typically be a length which is at least the length spanned by 
two nucleoUdc monomers, and preferably at least four 
nucleotide monomers, while not be so long as to interfere 
with cither the pairing of X' and X= or any subsequent 
assays. The linking group itself will typically be an alkylene 
group (of from about 6 to about 24 carbons in length), a 
polycthyleneglycol group (of from about 2 to about 24 
ethylencglycol monomers in a linear configuration), a poly- 
alcohol group, a polyamine group (e.g., spermine, spermi- 
dine and polymeric derivatives thereoO. a polyester group 
(eg. poly (ethyl aery late) having of from 3 to 15 ethyl 
acrylate monomers in a linear configuration), a polyphos- 
phodiester group, or a polynucleotide (having from about 2 
to about 12 nucleic acids). Preferably, the linking group will 
be a polycthyleneglycol group which is at least a tetraeth- 65 
ylcncglycol, and more preferably, from about 1 to 4 hexa- 
ethyleneglycols linked in a linear array. For use in synthesis 



of the compounds of the invenUon, the linking group will be 
provided with functional groups which can be suitab y 
protected or activated. The linking group will be covalenUjj 
attached to each of the complementary oligonucleotides, X 
and X^ by means of an ether, ester, carbamate, phosphate 
ester or amine linkage. The flexible linking group L will be 
attached to the 5'-hydroxyl of the terminal monomer^of X 
and to the 3'-hydroxyl of the initial monomer . of X . Pre- 
ferred linkages are phosphate ester linkages which can be 
formed in the same manner as the oligonucleodde linkages 
which are present in X* and X^. For example, hexaethyl- 
eneelycol can be protected on one terminus with a photo- 
labile protecting group (i.e., NVOC or MeNPOC) and 
activated on the other terminus with 2-cyanoethyl-N.N- 
diisopropylamino-chlorophosphite to form a phosphoramid- 
ite. This linking group can then be used for construction of 
the libraries in the same manner as the photolabile-protected, 
phosphoramidite-acUvated nucleotides. Alternatively, ^ester 
linkages to X^ and X^ can be formed when the L has 
terminal carboxylic acid moieties (using the 5'-hydroxyl of 
X^ and the 3'-hydroxyl of X^). Other methods of forrmng 
ether, carbamate or amine linkages are known to those of 
skill In the art and particular reagents and references can be 
found in such texts as March. Advanced Organic Chemistry. 
4th Ed., Wiley-Interscience, New York, N.Y, 1992, incor- 
porated herein by reference. 

The oligonucleotide. X^, which is covalenUy attached to 
the distal end of the linking group is, like X\ a single- 
su-anded DNA or RNA molecule. The oligonucleotides 
which are part of the present invention are typically of from 
about 4 to about 100 nucleotides in length. Preferably, X is 
an oligonucleotide which is about 6 to about 30 nucleoudes 
in length and exhibits complementary to X' of from 90 to 
100%. More preferably, X* and X^ arc 100% complemen- 
tary. In one group of embodiments, either X or X will 
further comprise a bulge or loop portion and exhibit comple- 
mentary of from 90 to 100% over the remainder of the 
oligonucleotide. . 

In a particularly preferred embodiment, the solid support 
is a silica support, the spacer is a polycthyleneglycol con- 
jugated to an aminoalkylsilanc. the linking group is a 
polycthyleneglycol group, and X* and X^ are complcmen-^ 
tary oligonucleotides each comprising of from 6 to 30 
nucleic acid monomers. 

The library can have virtually any number of different 
members, and will be limited only by the number or variety 
of compounds desired to be screened in a given application 
and by the synthetic capabilities of the practitioner. In one 
group of embodiments, the library will have from 2 up to 
100 members. In other groups of embodiments, the library 
will have between 100 and 10000 members, and between 
10000 and 1000000 members, preferably on a solid support. 
In preferred embodiments, the library will have a density of 
more than 100 members at known locations per cm . pref- 
erably more than 1000 per cm^ more preferably more than 
10,000 per cm^. 

Libraries of Conformational ly Restricted Probes 

In still another aspect, the present invention provides 
libraries of conformational ly -restricted probes. Each of die 
members of the library comprises a solid support havmg an 
optional spacer which is attached to an oligomer of the 
formula; 
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in which X*^ and X*^ arc complementary oligonucleotides 
and Z is a probe. The probe will have sufficient length such 
that X^' and X'^ form a double-su^ded DNA portion of 
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each member. X^^ and X'^ are as described above for X^ and 
X^ respectively, except that for the present aspect of the 
invention, each member of the probe library can have the 
same X^^ and the same X^^, and differ only in the probe 
portion. In one group of embodiments, X" and X^^ are 5 
cither a poly-A oligonucleotide or a poly-T oligonucleotide. 

As noted above, each member of the library will typically 
have a different probe portion. The probes, Z, can be any of 
a variety of structures for which receptor-probe binding 
information is sought for conformationally-restricted forms. 10 
For example, the probe can be an agonist or antagonist for 
a cell membrane receptor, a toxin, venom, vital epitope, 
hormone, peptide, enzyme, collector, drug, protein or anti- 
body. In one group of embodiments, the probes are different 
peptides, each having of from about 4 to about 12 amino 15 
acids. Preferably the probes will be linked via polyphos- 
phate dicsiers, although other linkages are also suitable. For 
example, the last monomer employed on the X^* chain can 
be a 5'-aminopropyl-functionalized phosphoramidiie nucle- 
otide (available from Glen Research, Steriing, Va., USA or 20 
Gcnosys Biotechnologies, The Woodlands, Tex., USA) 
which will provide a synthesis initiation site for the carboxy 
to amino synthesis of the peptide probe. Once the peptide 
probe is formed, a 3'-succinylated nucleoside (from Cru- 
achem. Sterling, Va., USA) will be added under peptide 25 
coupling conditions. In yet another group of embodiments, 
the probes will be oligonucleotides of from 4 to about 30 
nucleic acid monomers which will form a DNA or RNA 
hairpin structure. For use in synthesis, the probes can also 
have associated functional groups (i.e., hydroxyl, amino, 30 
carboxylic acid, anhydride and derivatives thereoO for 
attaching two positions on the probe to each of the comple- 
mentary oligonucleotides. 

The surface of the solid support is preferably provided 
with a spacer molecule, although it will be understood that 35 
the spacer molecules arc not elements of this aspect of the 
invention. Where present, the spacer molecules will be as 
described above for 

The libraries of con form ationally restricted probes can 
also have virtually any number of members. As above, the 40 
number of members will be limited only by design of the 
particular screening assay for which the library will be used, 
and by the synthetic capabilities of the practitioner. In one 
group of embodiments, the library will have from 2 to 100 
members. In other groups of embodiments, the library will 45 
have between 100 and 10000 members, and between 10000 
and 1000000 members. Also as above, in preferred embodi- 
ments, the library will have a density of more than 100 
members ai known locations per cm^, preferably more than 
1000 per cm^, more preferably more than 10,000 per cm^. 50 
Preparation of the Libraries 

The present invention further provides methods for the 
preparation of diverse unimolccular, double-stranded oligo- 
nucleotides on a solid support. In one group of embodi- 
ments, the surface of a solid support has a plurality of 55 
preselected regions. An oligonucleotide of from 6 to 30 
monomers is formed on each of the preselected regions. A 
Unking group is then attached to the distal end of each of the 
oligonucleotides. Finally, a second oligonucleotide is 
formed on the distal end of each linking group such that the 60 
second oligonucleotide is complementary to the oligonucle- 
otide already present in the same preselected region. The 
linking group used will have sufEcient length such that the 
complementary oligonucleotides form a unimolccular, 
double-stranded oligonucleotide. In another group of 65 
embodiments, each chemically distinct member of the 
library will be synthesized on a separate solid support. 



Libraries on a Single Substrate 
Light-Directed Methods 

For those embodiments using a single solid support, the 
oligonucleotides of the present invention can be formed 
using a variety of techniques known to those skilled in the 
art of polymer synthesis on solid supports. For example, 
"light directed" methods (which are one technique in a 
family of methods known as VLSIPS''^ methods) are 
described in U.S. Pat. No. 5,143,854, previously incorpo- 
rated by reference. The light directed methods discussed in 
the *854 patent involve activating predefined regions of a 
substrate or solid support and then contacting the substrate 
with a preselected monomer solution. The predefined 
regions can be activated with a light source, typically shown 
through a mask (much in the manner of photolithography 
techniques used in integrated circuit fabrication). Other 
regions of the substrate remain inactive because they arc 
blocked by the mask from illumination and remain chemi- 
cally protected. Thus, a light pattern defines which regions 
of the substrate react with a given monomer. By repeatedly 
activating different sets of predefined regions and contacting 
different monomer solutions with the substrate, a diverse 
array of polymers is produced on the substrate. Of course, 
other steps such as washing unreaclcd monomer solution 
from the substrate can be used as necessary. Other tech- 
niques include mechanical techniques such as those 
described in PCT No. 92/10183, U.S. Pat. No. 5,384,261 
also incorporated herein by reference for all purposes. Still 
further techniques include bead based techniques such as 
those described in PCT US/93/04145, also incorporated 
herein by reference, and pin based methods such as those 
described in U.S, Pal. No. 5,288,514, also incorporated 
herein by reference. 

The VLSIPS™ methods arc preferred for making the 
compounds and libraries of the present invention. The 
surface of a solid support, optionally modified with spacers 
having phoiolabilc protecting groups such as NVOC and 
McNPOC, is illuminated through a photolithographic mask, 
yielding reactive groups (typically hydroxyl groups) in the 
illuminated regions. A 3*-0-phosphoramidite activated 
dcoxynuclcosidc (protected at the 5'-hydroxyl with a pho- 
iolabilc protecting group) is then presented to the surface 
and chemical coupling occurs at sites that were exposed to 
light. Following capping, and oxidation, the substrate is 
rinsed and the surface illuminated through a second mask, to 
expose additional hydroxyl groups for coupling. A second 
5'-protcctcd. 3"-0-phosphoramiditc activated dcoxynuclco- 
sidc is presented to the surface. The selective photodcpro- 
tcciion and coupling cycles arc repeated until the desired set 
of oligonucleotides is produced. Alicmalively, an oligomer 
of from, for example, 4 to 30 nucleotides can be added to 
each of the preselected regions rather than synthesize each 
member in a monomer by monomer approach. At this point 
in ihc synthesis, cither a ficxible linking group or a probe can 
be attached in a similar manner. For example, a flexible 
linking group such as polyethylene glycol will typically 
having an activating group (i.e., a phosphoramidiie) on one 
end and a photolabile protecting group attached to the other 
end. Suitably dcrivatizcd polyethylene glycol linking groups 
can be prepared by the methods described in Durand, ct al. 
Nucleic Acids Res. 18:6353-6359 (1990). Briefly, a poly- 
ethylene glycol (i.e., hexaethylcnc glycol) can be mono- 
protected using MeNPOC-chloride, Following purification 
of the mono-protected glycol, the remaining hydroxy moiety 
can be aciivatcd with 2-cyanocthyl-N,N-diisopropylami- 
nochlorophosphiic. Once the flexible linking group has been 
attached lo the first oligonucleotide (X*), deproiection and 
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coupling cycles will proceed using 5'-proiected, 3*-0-phos- 
phoramidite activated deoxynucleosides or intact oligomers. 
Probes can be attached in a manner similar to that used for 
the flexible linking group. When the desired probe is itself 
an oligomer, it can be formed either in stepwise fashion on 
the immobilized oligonucleotide or it can be separately 
synthesized and coupled to the immobilized oligomer in a 
single .step. For example, preparation of conformationally 
restricted P-lum mimetics will typically involve synthesis of 
an oligonucleotide as described above, in which the last 
nucleoside monomer will be derivatized with an aminoalkyl- 
functionalized phosphoramidite. See. U.S. Pat. No. 5,288, 
514, previously incorporated by reference. The desired 
peptide probe is typically formed in the direction from 
carboxyl to amine terminus. Subsequent coupling of a 
3'-succinylatcd nucleoside, for example, provides the first 
monomer in the consUnction of the complementary oligo- 
nucleotide strand (which is carried out by the above meth- 
ods). Alternatively, a library of probes can be prepared by 
first derivalizing a solid support with multiple poly(A) or 
poly(T) oligonucleotides which are suitably protected with 
photolabilc protecting groups, deprotecting at known sites 
and constructing the probe at those sites, then coupling the 
complementary poly(T) or poly(A) oligonucleotide. 
Flow Channel or Spotting Methods 
Additional methods applicable to library synthesis on a 
single subsu-ate are described in co-pending applications 
Sen No. 07/980,523. filed Nov. 20, 1992, and U.S. Pat, No. 
5,384,261, incorporated herein by reference for all purposes. 
In the methods disclosed in these applications, reagents are 30 
delivered to the subsu^te by either (1) fiowing within a 
channel defined on predefined regions or (2) "spotting" on 
predefined regions. However, other approaches, as well as 
combinations of spotting and flowing, may be employed. In 
each instance, certain activated regions of the substrate are 
mechanically separated from other regions when the mono- 
mer solutions are delivered to the various reaction sites. 

A typical "flow channel" method applied to the com- 
pounds and libraries of the present invention can generally 
be described as follows. Diverse polymer sequences are 
synthesized at selected regions of a substrate or solid support 
by forming flow channels on a surface of the substrate 
through which appropriate reagents flow or in which appro- 
priate reagents arc placed. For example, assume a monomer 
"A" is to be bound to the subsU-ate in a first group of selected 45 
regions. If necessary, all or part of the surface of the 
substrate in all or a part of the selected regions is activated 
for binding by, for example, flowing appropriate reagents 
through all or some of the channels, or by washing the entire 
substrate with appropriate reagents. After placement of a 
channel block on the surface of the substrate, a reagent 
having the monomer A flows through or is placed in all or 
some of the channel(s). The channels provide fluid contact 
to the first selected regions, thereby binding the monomer A 
on the substrate directly or indirectly (via a spacer) in the 
first selected regions. 

Thereafter, a monomer B is coupled to second selected 
regions, some of which may be included among the first 
selected regions. The second selected regions will be in fluid 
contact with a second flow channel(s) through translation, 
rotation, or replacement of the channel block on the surface 
of the substrate; through opening or closing a selected valve; 
or through deposition of a layer of chemical or photoresist. 
If necessary, a step is performed for activating at least the 
second regions. Thereafter, the monomer B is flowed 
through or placed in the second flow channel(s). binding 
monomer B at the second selected locations. In this particu- 
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lar example, the resulting sequences bound to the substrate 
at this stage of processing will be, for example, A, B, and 
AB. The process is repeated to form a vast array of 
sequences of desired length at known locations on the 
substrate. 

After the substrate is activated, monomer A can be flowed 
through some of the channels, monomer B can be flowed 
through other channels, a monomer C can be flowed through 
still other channels, etc. In this manner, many or all of the 
reaction regions are reacted with a monomer before the 
channel block must be moved or the substrate must be 
washed and/or reactivated. By making use of many or all of 
die available reaction regions simultaneously, the number of 
washing and activation steps can be minimized. 

One of skill in the art will recognize that there are 
alternative methods of forming channels or otherwise pro- 
tecting a portion of the surface of the substrate. For example, 
according to some embodiments, a protective coating such 
as a hydrophilic or hydrophobic coating (depending upon 
the nature of the solvent) is utilized over portions of the 
substrate to be protected, sometimes in combination with 
materials that facilitate wetting by the reactant solution in 
other regions. In this manner, the fiowing solutions are 
further prevented from passing outside of their designated 
flow paths. 

The "spotting" methods of preparing compounds arid 
libraries of the present invention can be implemented in 
much the same manner as the flow channel methods. For 
example, a monomer A can be delivered to and coupled with 
a first group of reaction regions which have been appropri- 
ately activated. Thereafter, a monomer B can be delivered to 
and reacted with a second group of activated reaction 
regions. Unlike the flow channel embodiments described 
above, rcactants are delivered by directly depositing (rather 
than flowing) relatively small quantities of them in selected 
regions. In some steps, of course, the entire substrate surface 
can be sprayed or otherwise coaled with a solution. In 
preferred embodiments, a dispenser moves from region to 
region, depositing only as much monomer as necessary at 
each stop. Typical dispensers include a micropipctlc to 
deliver the monomer solution to the substrate and a robotic 
system to conU-ol the position of the micropipctte with 
respect to the subsUatc, or an ink-jet printer. In other 
embodiments, the dispenser includes a scries of tubes, a 
manifold, an array of pipettes, or the like so that various 
reagents can be delivered to the reaction regions simulta- 
neously. 

Pin-Based Methods 

Another method which is useful for the preparation of 
compounds and libraries of the present invention involves 
"pin based synthesis." This method is described in detail in 
U.S. Pat. No. 5,288,514, previously incorporated herein by 
reference. The method utilizes a substrate having a plurality 
of pins or other extensions. The pins are each inserted 
simultaneously into individual reagent containers in a tray. 
In a common embodiment, an array of 96 pins/containers is 
utilized. 

Each tray is filled with a particular reagent for coupling in 
a particular chemical reaction on an individual pin. Accord- 
ingly, the trays will often contain different reagents. Since 
the chemistry disclosed herein has been established such that 
a relatively similar set of reaction conditions may be utilized 
to perform each of the reactions, it becomes possible to 
conduct multiple chemical coupling steps simultaneously. In 
the first step of the process the invention provides for the use 
of substrate(s) on which the chemical coupling steps are 
conducted. The substrate is optionally provided with a 



50 



55 



60 



65 



5,556,752 



IS 



16 



spacer having active sites. In the particular case of oligo- 
nucleotides, for example, the spacer may be selected from a 
wide variety of molecules which can be used in organic 
environments associated with synthesis as well as aqueous 
environments associated with binding studies. Examples of 5 
suitable spacers arc polyethylcneglycols, dicarboxylic acids, 
polyamines and alkylenes, substituted with, for example, 
mcthoxy and cthoxy groups. Additionally, the spacers will 
have an active site on the distal end. The active sites are 
optionally protected initially by protecting groups. Among a lo 
wide variety of protecting groups which are useful are 
FiMOC, BOC, t-butyl esters, t-butyl ethers, and the like. 
Various exemplary protecting groups arc described in, for 
example, Atherlon et al., Solid Phase Peptide Synthesis, IRL 
Press (1989), incorporated herein by reference. In some 15 
embodiments, the spacer may provide for a cleavable func- 
tion by way of, for example, exposure to acid or base. 
Libraries on Multiple Substrates 
Bead Based Methods 

Yet another method which is useful for synthesis of 20 
compounds and libraries of the present invention involves 
"bead based synthesis." A general approach for bead based 
synthesis is described copending application Ser. Nos. 
07/762,522 (filed Sep. 18, 1991 now abandoned); 07/946, 
239 (filed Sep. 16, 1992); 08/146,886 (filed Nov. 2, 1993); 25 
07/876,792 (filed Apr. 29, 1992) and PCT/US 93/04 145 
(filed Apr. 28, 1993), the disclosures of which are incorpo- 
rated herein by reference. 

For the synthesis of molecules such as oligonucleotides 
on beads, a large plurality of beads arc suspended in a 30 
suitable carrier (such as water) in a container. The beads arc 
provided with optional spacer molecules having an active 
site. The active site is protected by an optional protecting 
group. 

In a first step of the synthesis, the beads are divided for 35 
coupling into a plurality of containers. For the purposes of 
this brief description, the number of comaincrs will be 
limited to three, and the monomers denoted as A, B, C, D, 
E. and F. The protecting groups are then removed and a first 
portion of the molecule to be synthesized is added to each of 40 
the three containers (i. c., A is added to container 1, B is 
added to container 2 and C is added to container 3). 

Thereafter, the various beads are appropriately washed of 
excess reagents, and remixed in one container. Again, it will 
be recognized that by virtue of the large number of beads 45 
utilized al the outset, there will similarly be a large number 
of beads randomly dispersed in the container, each having a 
particular first portion of the monomer to be synthesized on 
a surface thereof. 

Thereafter, the various beads are again divided for cou- 50 
pling in another group of three containers. The beads in the 
first container are deproiecied and exposed to a second 
monomer (D), while the beads in the second and third 
containers are coupled to molecule portions E and F respec- 
tively. Accordingly, molecules AD, BD, and CD will be 55 
present in the first container, while AE, BE, and CE will be 
present in the second container, and molecules AF, BF, and 
CF will be present in the third container. Each bead, how- 
ever, will have only a single type of molecule on its surface. 
Thus, all of the possible molecules formed from the first 60 
portions A, B, C, and the second portions D, E, and F have 
been formed. 

The beads are then recombined into one container and 
additional steps such as are conducted to complete the 
synthesis of the polymer molecules. In a preferred embodi- 65 
mcnt, the beads arc tagged with an identifying tag which is 
unique to the particular double-stranded oligonucleotide or 



probe which is present on each bead. A complete description 
of identifier lags for use in synthetic libraries is provided in 
co-pending appUcadon Ser. No, 08/146,886 (filed Nov. 2, 
1993) previously incorporated by reference for all purposes. 
Methods of Library Screening 

A library prepared according to any of the methods 
described above can be used to screen for receptors having 
high affinity for either unimolecular, double-stranded oligo- 
nucleotides or conformationally restricted probes. In one 
group of embodiments, a solution containing a marked 
(labelled) receptor is introduced to the library and incubated 
for a suitable period of time. The library is then washed free 
of unbound receptor and the probes or double-stranded 
oligonucleotides having high affinity for the receptor are 
identified by identifying those regions on the surface of the 
library where markers arc located. Suitable markers include, 
but are not limited to, radiolabels, chromophores, fluoro- 
phores, chemiluminescent moieties, and transition metals. 
Alternatively, the presence of receptors may be detected 
using a variety of other techniques, such as an assay with a 
labelled enzyme, antibody, and the like. Other techniques 
using various marker systems for delecting bound receptor 
will be readily apparent to those skilled in the art 

In a preferred embodiment, a library prepared on a single 
solid support (using, for example, the VLSIPS™ technique) 
can be exposed to a solution containing marked receptor 
such as a marked antibody. The receptor can be marked in 
any of a variety of ways, but in one embodiment marking is 
effected with a radioactive label. The marked antibody binds 
with high affinity to an immobilized antigen previously 
localized on the surface. After washing the surface free of 
unbound receptor, the surface is placed proximate to x-ray 
film or phosphorimagcrs lo identify the antigens that arc 
recognized by ihc antibody. Alternatively, a fluorescent 
marker may be provided and detection may be by way of a 
charge-coupled device (CCD), fluorescence microscopy or 
laser scanning. 

When autoradiography is the detection method used, the 
marker is a radioactive label, such as ^^P. The marker on the 
surface is exposed to X-ray film or a phosphorimager, which 
is developed and read out on a scanner. An exposure time of 
about I hour is typical in one embodiment. Fluorescence 
detection using a fluorophorc label, such as fluorescein, 
auachcd to the receptor will usually require shorter exposure 
limes. 

Quantitative assays for receptor concentrations can also 
be performed according to the present invention. In a direct 
assay method, the surface containing localized probes pre- 
pared as described above, is incubated with a solution 
containing a marked receptor for a suitable period of time. 
The surface is then washed free of unbound receptor. The 
amount of marker present at predefined regions of the 
surface is then measured and can be related to the amount of 
receptor in solution. Methods and conditions for performing 
such assays arc well-known and are presented in, for 
example, L. Hood ct al., Immunology, Bcnjamin/Cummings 
(1978), and E. Harlow et al., Antibodies. A Laboratory 
Manual, Cold Spring Harbor Laboratory, (1988). See. also 
U.S. Pat. No. 4,376.110 for methods of performing sandwich 
assays. The precise conditions for performing these steps 
will be apparent to one skilled in the art. 

A competitive assay method for two receptors can also be 
employed using the present invention. Methods of conduct- 
ing competitive assays are known to those of skill in the art. 
One such method involves immobilizing conformationally 
restricted probes on predefined regions of a surface as 
described above. An unmarked first receptor is then bound 



5.556,752 



17 



18 



10 



to the probes on the surface having a known specific binding 
affinity for the receptors. A solution containing a marked 
second receptor is then introduced to the surface and incu- 
bated for a suitable time. The surface is then washed free of 
unbound reagents and the amount of marker remaining on 
the surface is measured. In another form of competition 
assay, marked and unmarked receptors can be exposed to the 
surface simultaneously. The amount of marker remaining on 
predefined regions of the surface can be related to the 
amount of unknown receptor in solution. Yet another form of 
competition assay will utilize two receptors having different 
labels, for example, two different chromophores. 

In other embodiments, in order to detect receptor binding, 
the double-stranded oligonucleotides which are formed with 
auached probes or with a flexible linking group will be 
treated with an intercalating dye, preferably a fluorescent 
dye. The library can be scanned to establish a background 
fluorescence. After exposure of the library to a receptor 
solution, the exposed library will be scanned or illuminated 
and examined for those areas iti which fluorescence has 
changed. Alternatively, the receptor of interest can be 
labeled with a fluorescent dye by methods known to those of 
skill in the art and incubated with the library of probes. The 
library can then be scanned or illuminated, as above, and 
examined for areas of fluorescence. 

In instances where the libraries are synthesized on beads 
in a number of containers, the beads are exposed to a 
receptor of interest. In a preferred embodiment the receptor 
is fluorescently or radioactively labelled. Thereafter, one or 
more beads are identified that exhibit significant levels of. 
for example, fluorescence using one of a variety of tech- 
niques. For example, in one embodiment, mechanical sepa- 
ration under a microscope is utilized. The identity of the 
molecule on the surface of such separated beads is then 
identified using, for example, NMR, mass spectrometry. 
PGR amplification and sequencing of the associated DNA, 
or the like. In another embodiment, automated sorting (i.e., 
fluorescence activated cell sorting) can be used to separate 
beads (bearing probes) which bind to receptors from those 
which do not bind. TVpically the beads will be labeled and 
identified by methods disclosed in Needels, et al., Proc. 
Natl Acad. Sci. USA 90:10700-10704 (1993), incorporated 
herein by reference. 

The assay methods described above for the libraries of the 
present invention will have tremendous application in such 
endeavors as DNA "footprinting" of proteins which bind 
DNA. Currently, DNA footprinting is conducted using 
DNasc I digestion of double-stranded DNA in the presence 
of a putative DNA binding protein. Gel analysis of cut and 
protected DNA fragments then provides a "footprint" of 50 
where the protein contacts the DNA. This method is both 
labor and time intensive. See. Galas et al., Nucleic Acid Res. 
5:3157 (1978). Using the above methods, a ''footprint" could 
be produced using a single array of unimolecular, double- 
stranded oligonucleotides in a fraction of the time of con- 
ventional methods. Typically, the protein will be labeled 
with a radioactive or fluorescent species and incubated with 
a library of unimolecular, double-stranded DNA. Phospho- 
rimaging or fluorescence detection will provide a footprint 
of those regions on the library where the protein has bound. 
Alteniativcly, unlabeled protein can be used. When unla- 
beled protein is used, the double-stranded oligonucleotides 
in the library will all be labeled with a marker, typically a 
fluorescent marker. Incorporation of a marker into each 
member of the library can be carried out by terminating the 
oligonucleotide synthesis with a commercially available 
fluorescing phosphoramidite nucleotide derivative. Follow- 
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ing incubation with the unlabeled protein, the.library will be 
treated with DNase 1 and examined for areas which are 
protected from cleavage. 

The assay methods described above for the libraries of the 
present invention can also be used in reverse drug discovery. 
In such an application, a compound having known pharma- 
cological safety or other desired properties (e.g., aspirin) 
could be screened against a variety of double-stranded 
oligonucleotides for potential binding. If the compound is 
shown to bind to a sequence associated with, for example, 
tumor suppression, the compound can be further examined 
for efficacy in the related diseases. 

In other embodiments, probe arrays comprising P-tum 
mimetics can be prepared and assayed for activity against a 
particular receptor, p-tum mimetics are compounds having 
molecular structures similar to P-tums which are one of the 
three major components in protein molecular architecture. 
p-tuTTis are similar in concept to hairpin turns of oligonucle- 
otide strands, and are often critical recognition features for 
various protein-ligand and protein-protein interactions. As a 
result, a library of P-tum mimetic probes can provide or 
suggest new therapeutic agents having a particular affinity 
for a receptor which will correspond to the affinity exhibited 
by the p-tum and its receptor. 
Bioelectronic Devices and Methods 

In another aspect, the present invention provides a method 
for the bioelectronic detection of sequence-specific oligo- 
nucleotide hybridization. A general method and device 
which is useful in diagnostics in which a biochemical 
species is attached to the surface of a sensor is described in 
U.S. Pat. No. 4,562,157 (the Lowe patent), incorporated 
herein by reference. The present method utilizes arrays of 
immobilized oligonucleotides (prepared, for example, using 
VLSIPS'*'^ technology) and the known photo-induced elec- 
tron transfer which is mediated by a DNA double helix 
structure. Sec, Murphy ct al.. Science 262:1025-1029 
(1993). This method is useful in hybridizationbascd diag- 
nostics, as a replacement for fluorescence-based detection 
systems. The method of bioelectronic detection also offers 
higher resolution and potentially higher sensitivity than 
earlier diagnostic methods involving sequencing/detecting 
by hybridization. As a result, this method finds applications 
in genetic mutation screening and primary sequencing of 
oligonucleotides. The method can also be used for Sequenc- 
ing By Hybridization (SBH), which is described in co- 
pending application Ser. Nos. 08/082,937 (filed Jun. 25. 
1993 now abandoned) and 08/168,904 (filed Dec. 15. 1993), 
each of which arc incorporated herein by reference for all 
purposes. This method uses a set of short oligonucleotide 
probes of defined sequence to search for complementary 
sequences on a longer target strand of DNA. The hybrid- 
ization pattern is used to reconstruct the target DNA 
sequence. Thus, the hybridization analysis of large numbers 
of probes can be used to sequence long stretches of DNA. In 
immediate applications of this hybridization methodology, a 
small number of probes can be used to interrogate local 
DNA sequence. 

In the present inventive method, hybridization is moni- 
tored using bioelectronic detection. In this method, the target 
DNA. or first oligonucleotide, is provided with an electron- 
donor tag and then incubated with an array of oligonucle- 
otide probes, each of which bears an electron-acceptor tag 
and. occupies a known position on the surface of the array. 
After hybridization of the first oligonucleotide to the array 
has occurred, the hybridized array is illuminated to induce 
an electron transfer reaction in the direction of the surface of 
the array. The electron transfer reaction is then detected at 
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the location on the surface where hybridization has taken 
place. Typically, each of the oligonucleotide probes in an 
array will have an attached electron-acceptor tag located 
near the surface of the solid support used in preparation of 
the array. In embodiments in which the arrays are prepared 5 
by light-directed methods (i.c, typically 3' to 5' direction), 
the electronacccptor tag will be located near the 3' position. 
The electron-acceptor tag can be attached either to the 3' 
monomer by methods known to those of skill in the art. or 
it can be attached to a spacing group between the 3' lo 
monomer and the solid support. Such a spacing group will 
have, in addition to functional groups for attachment to the 
solid support and the oligonucleotide, a third functional 
group for attachment of the electronacccptor tag. The target 
oligonucleotide will typically have the electron-donor tag 15 
attached at the 3' position. Alternatively, the target oligo- 
nucleotide can be incubated with the array in the absence of 
an clcctron-donor tag. Following incubation, the electron- 
donor tag can be added in solution. The clecU*on-donor lag 
will then intercalate into those regions where hybridization 20 
has occurred. An electron transfer reaction can then be 
detected in those regions having a continuous DNA double 
helix. 

The electron-donor tag can be any of a variety of com- 
plexes which participate in electron transfer reactions and 25 
which can be attached to an oligonucleotide by a means 
which docs not interfere with the electron transfer reaction. 
In preferred embodiments, the electron-donor lag is a ruthe- 
nium (II) complex, more preferably a ruthenium (II) 
(phen*)2(dppz) complex. 

The electron-acceptor tag can be any species which, with 
the clcctron-donor tag, will participate in an electron transfer 
reaction. An example of an electron-acceptor tag is a 
rhodium (III) complex. A preferred electron- acceptor tag is 
a rhodium (III) (phi)2(phen') complex. 

In a particularly preferred embodiment, the clcctron- 
donor tag is a ruthenium (II) (phen')2(dppz) complex and the 
electron- acceptor lag is a rhodium (III) (phi)2(phcn') com- 
plex. 

In still another aspect, the present invention provides a 
device for the bioclcctronic detection of sequence-specific 
oligonucleotide hybridization. The device will typically con- 
sist of a sensor having a surface to which an array of 
oligonucleotides arc attached. The oligonucleotides will be 
attached in prc-dc fined areas on the surface of the sensor and 
have an electron-acceptor tag aUachcd to each oligonucle- 
otide. The electron-acceptor tag will be a tag which is 
capable of producing an electron transfer signal upon illu- 
mination of a hybridized species, when the complementary 
oligonucleotide bears an clcctrondonaiing lag. The signal 
will be in the direction of the sensor surface and be detected 
by the sensor. 

In a preferred embodiment, the sensor surface will be a 
silicon-based surface which can sense Ihe electronic signal 
induced and, if necessary, amplify the signal. The metal 
contacts on which the probes will be synlhesized can be 
treated with an oxygen plasma prior lo synthesis of the 
probes lo enhance the silane adhesion and concentration on 
the surface. The surface will further comprise a multi-gated 
field effect transistor, with each gate serving as a sensor and 60 
different oligonucleotides attached to each gate. The oligo- 
nucleotides will typically be attached to the metal contacts 
on the sensor surface by means of a spacer group. 

The spacer group should not be too long, in order to 
ensure that the sensing function of the device is easily 65 
activated by the binding interaction and subsequent illumi- 
nation of the "tagged" hybridized oligonucleoiidcs. Prcfcr- 
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ably, the spacer group is from 3 to 1 2 atoms in length and 
will be as described above for the surface modifying portion 
of the spacer group. L*. 

The oligonucleotides which are attached to the spacer 
group can be formed by any of the solid phase techniques 
which are known lo those of skill in the art. Preferably, the 
oligonucleotides are formed one base al a time in the 
direction of the 3' lerminus to the 5' terminus by the 
*'light-direcied" methods described above. The oligonucle- 
otide can theri be modified at the 3' end to attach the 
elccu-on-acccpior tag. A number of suitable methods of 
attachment are known. For example, modification with the 
reagent Aminolink2 (from Applied Biosy stems. Inc.) pro- 
vides a terminal phosphate moiety which is derivatized with 
an aminohexyl phosphate ester. Coupling of a carboxylic 
acid, which is present on the electron-acceptor tag, to the 
amine can ihcn be carried out using HOBT and DCC. 
Altemaiively, synthesis of the oligonucleotide can begin 
with a suitably derivatized and protected monomer which 
can then be deprotccicd and coupled to the clectron-acceplor 
tag once the complete oligonucleotide has been synthesized. 

The silica surface can also be replaced by silicon niuidc 
or oxynitride. or by an oxide of another metal, especially 
aluminum, titanium (IV) or iron (III). The surface can also 
be any other film, membrane, insulator or semiconductor 
overlying the sensor which will not interfere with the 
detection of electron transfer detection and to which an 
oligonucleotide can be coupled. 

Additionally, detection devices other than an FET can be 
used. For example, sensors such as bipolar transistors, MOS 
transistors and the like are also useful for the detection of 
clecu-on Uansfcr signals. 
Adhesivcs 

In still another aspect, the present invention provides an 
adhesive comprising a pair of surfaces, each having a 
plurality of attached oligonucleotides, wherein the singlc- 
su-andcd oligonucleotides on one surface arc complementary 
to the single-stranded oligonucleotides on the other surface. 
The strength and position/orientation specificity can be 
controlled using a number of factors including the number 
and length of oligonucleotides on each surface, the degree of 
complementary, and the spatial arrangement of complemen- 
tary oligonucleotides on the surface. For example, increas- 
ing the number and length of the oligonucleotides on each 
surface will provide a stronger adhesive. Suitable lengths of 
oligonucleotides arc typically from about 10 to about 70 
nucleotides. Additionally, the surfaces of oligonucleotides 
can be prepared such that adhesion occurs in an extremely 
position-specific manner by a suitable arrangement of 
complcmcnlary oligonucleotides in a specific pattern. Small 
deviations from the optimum spatial arrangement arc ener- 
getically unfavorable as many hybridization bonds must be 
broken and are not reformed in any other relative orienta- 
tion. 

The adhesivcs of the present invention will find use in 
numerous applications. Generally, the adhesivcs are useful 
for adhering iwo surfaces to one another. More specifically, 
the adhesivcs will find application where biological com- 
patibility of the adhesive is desired. An example of a 
biological application involves use in surgical procedures 
where tissues , must be held in fixed positions during or 
following the procedure, in tiiis application, the surfaces of 
the adhesive will typically be membranes which are com- 
patible wiih the tissues to which they are attached. 

A particular advantage of the adhesivcs of the present 
invention is that when they arc formed in an orientation 
specific manner, die adhesive portions will be "self- finding," 
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that is the system will go to the thermodynamic equilibrium 
in which the two sides are matched in the predetermined, 
orientation specific manner 

EXAMPLES 
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Example 1 

This example illustrates the general synthesis of an array 
of uni molecular, double-stranded oligonucleotides on a solid 
support. 

Unimolecular double stranded DNA molecules were syn- 
thesized on a solid support using standard light-directed 
methods (VLSIPS'*" protocols). Two hexaethylene glycol 
(PEG) linkers were used to covalently attach the synthesized 
oligonucleotides to the derivatized glass surface. Synthesis 
of the first (inner) strand proceeded one nucleotide at a time 
using repeated cycles of photo-deprotection and chemical 
coupling of protected nucleotides. The nucleotides each had 
a protecting group on the base portion of the monomer as 
well as a photolabile MeNPoc protecting group on the 5' 
hydroxy I. Upon completion of the inner strand, another 
McNPoc-protected PEG linker was covalently attached to 
the 5* end of the surface-bound oligonucleotide. After addi- 
tion of the internal PEG linker, the PEG is photodeprotecled. 
and the synthesis of the second strand proceeded in the 
normal fashion. Following the synthesis cycles, the DNA 
bases were deprotected using standard protocols. The 
sequence of the second (outer) strand, being complementary 
to that of the inner strand, provided molecules with short, 
hydrogen bonded, unimolecular double-stranded structure 
as a result of the presence of the internal flexible PEG linker. 

An array of 16 different molecules were synthesized on a 
derivatized glass slide in order to determine whether short, 
unimolecular DNA structures could be formed on a surface 
and whether they could adopt structures that are recognized 
by proteins. Each of the 16 different molecular species 
occupies a different physical region on the glass surface so 
thai there is a one-to-one correspondence between molecular 
identity and physical location. The molecules are of the form 
S-P-P-C-C-A/T-A/T-A/T-AyT-G-C-P-G-C-AyT-AA'-AA'- 
Ayr-G-G-F 

where S is the solid surface having silyl groups, Pisa PEG 
linker. A, C, G, and T are the DNA nucleotides, and F is a 
fluorescent lag. The DNA sequence is listed from the 3' to 
the 5' end (the 3' end of the DNA molecule is attached to the 
solid surface via a silyl group and 2 PEG linkers). The 
sixteen molecules synthesized on the solid support differed 
in the various permutations of A and T in the above formula. 

Example 2 

This example illustrates the ability of a library of surface- 
bound, unimolecular, double-stranded oligonucleotides to 
exist in duplex form and to be recognized and bound by a 55 
protein. 

A library of 16 different members was prepared as 
described in Example 1. The 16 molecules all have the same 
composition (same number of As, Cs, Gs and Ts), but the 
order is different. Four of the molecules have an outer su-and 
that is \Q0% complementary to the inner strand (these 
molecules will be referred to as DS. doublestranded, below). 
One of the four DS oligonucleotides has a sequence that is 
recognized by the restriction enzyme EcoRl. If the molecule 
can loop back and form a DNA duplex, it should be 
recognized and cut by the resuiclion enzyme, thereby releas- 
ing the fluorescent tag. Thus, the action of the enzyme 
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provided a functional test for DNA structure, and also served 
to demonstrate that these structures can be recognized at the 
surface by proteins. The remaining 12 molecules had outer 
strands that were not complementary to their irmer strands 
(referred to as SS, single-stranded, below). Of these, three 
had an outer strand and three had an inner strand whose 
sequence was an EcoRl half-site (the sequence on one 
strand was correct for the enzyme, but the other half was 
not). T^e solid support with an array of molecules on the 
surface is referred to as a "chip" for the purposes of the 
following discussion. The presence of fluorescently labelled 
molecules on the chip was detected using confocal fluores- 
cence microscopy. The action of various enzymes was 
determined by monitoring the change in the amount of 
fluorescence from die molecules on the chip surface (e.g. 
"reading" the chip) upon treatment with enzymes that can 
cut the DNA and release the fluorescent tag at the 5' end. 

The three different enzymes used to characterize the 
structure of the molecules on the chip were: 

1) Mung Bean Nuclease — sequence independent, single- 
strand specific DNA endonuclease; 

2) DNase I — sequence independent, double-strand spe- 
cific endonuclease; 

3) EcoRl— restriction endonuclease that recognizes the 
sequence (5-3*) 

GAATTC in double stranded DNA. and cuts between the 
G and the first A. Mung Bean Nuclease and EcoRl were 
obtained from New England Biolabs, and DNase I was 
obtained from Boehringer Mannheim. All enzymes were 
used at a concenU-ation of 200 units per mL in the buffer 
recommended by the manufacturer. The enzymatic reactions 
were performed in a 1 mL flow cell at 22° C, and were 
typically allowed to proceed for 90 minutes. 

Upon treatment of the chip with the enzyme EcoRl, the 
fluorescence signal in the DS EcoRl region and the 3 SS 
regions with the EcoRl half-site on the outer strand was 
reduced by about 10% of its initial value. This reduction was 
al least 5 times greater than for the other regions of the chip, 
indicating that the action of the enzyme is sequence specific 
on the chip. It was not possible to determine if the factor is 
greater than 5 in these preliminary experiments because of 
uncertainty in the constancy of the fluorescence background. 
However, because ihc purpose of these early experiments 
was to determine whether unimolecular double-stranded 
structures could be formed and whether they could be 
specifically recognized by proteins (and not to provide a 
quantitative measure of enzyme specificity), qualitative dif- 
ferences between the different synthesis regions were suf- 
ficient. 

The reduction in signal in the 3 SS regions with the EcoRl 
half-site on the outer strand indicated either that the enzyme 
cuts single-stranded DNA with a particular sequence, or that 
these molecules formed a double-stranded structure that was 
recognized by the enzyme. The molecules on the chip 
surface were at a relatively high density, with an average 
spacing of approximately 100 angstroms. Thus, it was 
possible for the outer strand of one molecule to form a 
double-stranded structure with the outer strand of a neigh- 
boring molecule. In the case of the 3 SS regions with the 
EcoRl half-site on the outer suand, such a bimolccular 
double-sU-anded region would have the correct sequence and 
structure to be recognized by EcoRl. However, it would 
differ from the unimolecular double-stranded molecules in 
that the inner strand remains single-stranded and thus ame- 
nable to cleavage by a single-surand specific endonuclease 
such as Mung Bean Nuclease. Therefore, it was possible to 
distinguish unimolecular from bimolecular double-suanded 
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DNA molecules on ihc surface by their ability lo be cut by 
single and double-strand specific cndonucleases. 

In order to remove all molecules that have single-stranded 
structures and lo identify unimolecular double-stranded 
molecules, the chip was first exhaustively treated with Mung 5 
Bean Nuclease. The reduction in the fluorescence signal was 
greater by about a factor of 2 for the SS regions of the chip, 
including those with the EcoRl half-site on the outer strand 
that were cleaved by EcoRl, than for the 4 DS regions. 
Following Mung Bean Nuclease treatment, the chip was 10 
treated with either DNase I (which cuts all remaining 
double-stranded molecules) or EcoRl (which should cut 
only the remaining double-stranded molecules with the 
correct sequence). Upon treatment with DNase I. the fluo- 
rescence signal in the 4 DS regions was reduced by at least 15 
5-fold more than the signal in the SS regions. Upon EcoRl 
treatment, the signal in the single DS region with the correct 
EcoRl sequence was reduced by at least a factor of 3 more 
than the signal in any other region on the chip. Taken 
together, these results indicated that the surface-bound mol- 20 
ccules synthesized with two complementary surands sepa- 
rated by a flexible PEG linker form intramolecular double- 
stranded structures that were resistant lo a single-strand 
specific cndonuclease and were recognized by both a 
doublc-slrand specific cndonuclease, and a sequence-spe- 25 
cific restriction enzyme. 
What is claimed is: 

1. A synthetic unimolecular, double-stranded oligonucle- 
otide library comprising a plurality of different members, 
each member having the formula: 
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Y_L>-X»-L'— 

wherein, 
Y is a solid support; 

X* and are a pair of complementary oligonucleotides; 

LMs a spacer; 

is a linking group having sufficient length such that X* 
and X^ form a double-stranded oligonucleotide. 

2. A library in accordance with claim 1, wherein is a 
polyethylene glycol group. 

3. A library in accordance with claim 1, wherein X* and 
X^ are complementary oligonucleotides each comprising of 
from 6 to 30 nucleic acid monomers. 

4. A library in accordance with claim 1, wherein said solid 
support is a silica support and L* comprises an aminoalkyl- 
silanc and from 1 to 4 hexaethyieneglycols. 

5. A library in accordance with claim 1, wherein said solid 
support is a silica support, comprises an aminoalkylsilane 
and from 1 lo 4 hexaethyieneglycols, is a polyethyleneg- 
lycol group and X* and X^ arc complementary oligonucle- 
otides each comprising of from 6 to 30 nucleic acid mono- 
mers. 

6. A synthetic unimolecular, double-stranded oligonucle- 
otide library of claim 1, wherein a portion of said double- 
stranded oligonucleotides formed by X^ and X^ further 
comprise a loop. 

:4c * * * * 



